Gradient-Based Label Binning in Multi-label Classification
نویسندگان
چکیده
In multi-label classification, where a single example may be associated with several class labels at the same time, ability to model dependencies between is considered crucial effectively optimize non-decomposable evaluation measures, such as Subset 0/1 loss. The gradient boosting framework provides well-studied foundation for learning models that are specifically tailored loss function and recent research attests achieve high predictive accuracy in setting. utilization of second-order derivatives, used by many approaches, helps guide minimization losses, due information about pairs it incorporates into optimization process. On downside, this comes computational costs, even if number small. work, we address bottleneck approach—the need solve system linear equations—by integrating novel approximation technique procedure. Based on derivatives computed during training, dynamically group predefined bins impose an upper bound dimensionality system. Our experiments, using existing rule-based algorithm, suggest boost speed without any significant performance.
منابع مشابه
Exploiting Associations between Class Labels in Multi-label Classification
Multi-label classification has many applications in the text categorization, biology and medical diagnosis, in which multiple class labels can be assigned to each training instance simultaneously. As it is often the case that there are relationships between the labels, extracting the existing relationships between the labels and taking advantage of them during the training or prediction phases ...
متن کاملBoosting-based Multi-label Classification
Multi-label classification is a machine learning task that assumes that a data instance may be assigned with multiple number of class labels at the same time. Modelling of this problem has become an important research topic recently. This paper revokes AdaBoostSeq multi-label classification algorithm and examines it in order to check its robustness properties. It can be stated that AdaBoostSeq ...
متن کاملOn Label Dependence in Multi-Label Classification
The aim of this paper is to elaborate on the important issue of label dependence in multi-label classification (MLC). Looking at the problem from a statistical perspective, we claim that two different types of label dependence should be distinguished, namely conditional and unconditional. We formally explain the differences and connections between both types of dependence and illustrate them by...
متن کاملMulti-Label Classification by Label Clustering based on Covariance
Multi-label classification is a supervised learning problem that predicts multiple labels simultaneously. One of the key challenges in such tasks is modelling the correlations between multiple labels. LaCova is a decision tree multi-label classifier, that interpolates between two baseline methods: Binary Relevance (BR), which assumes all labels independent; and Label Powerset (LP), which learns...
متن کاملMulti Label Text Classification through Label Propagation
Classifying text data has been an active area of research for a long time. Text document is multifaceted object and often inherently ambiguous by nature. Multi-label learning deals with such ambiguous object. Classification of such ambiguous text objects often makes task of classifier difficult while assigning relevant classes to input document. Traditional single label and multi class text cla...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Lecture Notes in Computer Science
سال: 2021
ISSN: ['1611-3349', '0302-9743']
DOI: https://doi.org/10.1007/978-3-030-86523-8_28